AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Large Model Inference

# Large Model Inference

Medgemma 27b Text It 4bit
Other
MedGemma-27B-Text-IT-4bit is an MLX-format model converted from Google's MedGemma-27B-Text-IT model, specifically optimized for medical and clinical reasoning tasks.
Large Language Model
M
mlx-community
193
3
Parakeet Tdt 0.6b V2 Onnx
NVIDIA Parakeet TDT 0.6B V2 is a model based on automatic speech recognition (ASR) tasks, suitable for English speech-to-text tasks.
Speech Recognition English
P
istupakov
129
3
Rank1 32b
MIT
rank1-32b is an information retrieval reranking model based on Qwen2.5-32B, which judges relevance by generating reasoning chains
Large Language Model Transformers English
R
jhu-clsp
18
0
Meta Llama 3.3 70B Instruct AWQ INT4
Llama 3.3 70B Instruct AWQ INT4 is the 4-bit quantized version of the Meta Llama 3.3 70B Instruct model, optimized for multilingual dialogue use cases and text generation tasks.
Large Language Model Transformers Supports Multiple Languages
M
ibnzterrell
6,410
22
Llama 3 8B Instruct QServe G128
Llama 3 is the next-generation open-source large language model introduced by Meta, featuring enhanced performance and broader application scenarios.
Large Language Model Transformers
L
mit-han-lab
197
2
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase